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Abstract. Games on graphs provide a natural and powerful model for reactive 
systems. In this paper, we consider generalized reachability objectives, defined as 
conjunctions of reachability objectives. We first prove that deciding the winner in 
such games is PSPACE-complete, although it is fixed-parameter tractable with 
the number of reachability objectives as parameter. Moreover, we consider the 
memory requirements for both players and give matching upper and lower bounds 
on the size of winning strategies. In order to allow more efficient algorithms, we 
consider subclasses of generalized reachability games. We show that bounding 
the size of the reachability sets gives two natural subclasses where deciding the 
winner can be done efficiently. 



1 Introduction 

Graphs games. Our purpose is to study reactive systems by abstracting them 
into graphs games: a state of the system is represented by a vertex in a finite 
directed graph, and a transition corresponds to an edge. If in a given state, the 
controller can choose the evolution of the system, then the corresponding vertex 
is controlled by the first player, Eve. Otherwise, the system evolves in an un- 
certain way: we consider the worst-case scenario where a second player, Adam, 
controls those states. To a run of the system corresponds a play on the game: we 
put a pebble in the initial vertex, then Eve and Adam move this pebble along the 
edges, constructing an infinite sequence. The specification of the system gives 
an objective Eve tries to ensure on this sequence. In order to synthesize a con- 
troller, we are interested in two questions: whether Eve wins in the game, and 
what resources are needed to construct a winning strategy (see HGTWOH for 
more details). 

System specifications. To specify properties of a system, we construct a set 
of infinite sequences representing the correct behaviors of the system. From 
an infinite sequence we extract finite information to decide whether the run 



it represents meet the specification. For instance, considering the set of ver- 
tices visited infinitely often allows to specify the classical w-regular proper- 
ties, e.g Buchi, parity, Streett, Rabin and Muller objectives. Other informations 
can be carried out, as for instance the set of vertices visited with positive fre- 
quency HTBG091 , or the order in which the vertices are visited for specifying 
LTL objectives MKPV07IHTW08IZiml 11 In this work, we observe the set of 
vertices visited at least once, which allows to specify reachability objectives, 
also called weak objectives HNS W02IS W74IMos9 1 IKVWOOl 

Generalized reachability objectives. The (simple) reachability objective re- 
quires, given a subset of vertices F, that a vertex from F is reached. Reacha- 
bility objectives only specifies that one property (represented by F) is satisfied 
along the run. We allow more properties to be specified by using generalized 
reachability objectives, defined as conjunctions of k reachability objectives. In 
this context, a reachability objective is often referred as a color: a generalized 
reachability objective is then to see each of the k colors at least once. 

2 Definitions 

The games we consider are played on an arena A = (V, (V , Vb), E), which 
consists of a finite graph (V, E) and a partition (V , Vb) of the vertex set V: a 
vertex is controlled by Eve if it belongs to V and by Adam if it belongs to Vb . 
Vertices from V are depicted by a circle, and vertices from Vb by a square. We 
denote by n the number of vertices and m the number of edges. Playing consists 
in moving a pebble along the edges: the pebble is placed on the initial vertex vq, 
then the player who controls the vertex chooses an edge and sends the pebble 
along this edge to the next vertex. From this infinite interaction results a play 
7r, which is an infinite sequence of vertices vq, v±, . . . where for all i, we have 
(vi,Vi + i) G E, i.e tt is an infinite path in the graph. We denote by il the set 
of all plays, and define objectives for a player by giving a set of winning plays 
C II. The games are zero-sum, which means that if Eve has the objective <P, 
then Adam has the objective II \<P (the objectives are opposite). Formally, a 
game is given by a couple Q = (A, <£) where A is an arena and <P an objective. 

A strategy for a player is a function that prescribes, given a finite history of 
the play, the next move. Formally, a strategy for Eve is a function a : V* ■ V — > 
V such that for a finite history w € V* and a current position v £ V , the 
prescribed move is legal, i.e along an edge: (v,a(w ■ v)) € E. Strategies for 
Adam are defined similarly, and usually denoted by r. Once a game Q = (A, ^), 
a starting vertex vq and strategies a for Eve and r for Adam are fixed, there is 
a unique play denoted by 7r(uo, a, r), which is said to be winning for Eve if it 
belongs to <I. The sentence "Eve wins from vq" means that she has a winning 
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strategy from vq, that is a strategy a such that for all strategy r for Adam, the 
play ir(vo,a, r) is winning. The first natural problem we consider is to "solve 
the game", that is given a game Q and a starting vertex vq, to decide whether Eve 
wins from vq. We denote by We{9) the winning positions of Eve, that is the set 
of vertices from where Eve wins (also referred as winning set), and analogously 
Wa(G) for Adam. We can prove that in generalized reachability games, we have 
We(A, ^) U Wa{A, <P) = V: from any vertex, either of the two players has a 
winning strategy. We say that the games are determined. 

The strategies as defined in their full generality above are infinite objects. 
Indeed, in this general setting, to pick the next-move, Eve considers the whole 
history of the play, whose size grows arbitrarily. A nicer setting, giving rise to 
finitely-representable objects, is to define strategies relying on memory struc- 
tures. Formally, a memory structure M. = (M, mo, /i) for an arena A consists 
of a set M of memory states, an initial memory state ttiq £ M, and an update 
function \i : M x E — > M. A memory structure is similar in fashion to an au- 
tomaton synchronized with the arena: it starts from iuq and reads the sequence 
of edges produced by the arena. Whenever an edge is taken, the current state is 
updated using the update function \i. A strategy relying on a memory structure 
M., whenever it picks the next move, considers only the current vertex and the 
current memory state: it is thus given by a next-move function v : V x M — > V . 
Formally, given a memory structure M. and a next-move function v, we can de- 
fine a strategy a for Eve by a(w ■ v) = u(v, n*(w • t?)). (The update function 
can be extended to a function fj,* : V + — > M by defining fJ,*(v) = tuq and 
H*{w ■ u ■ v) = fi(n*(w ■ u), (u, v)).) A strategy with memory structure M. has 
finite memory if M is a finite set. It is memoryless, or positional if M is a single- 
ton: in this case, the choice for the next move only depends on the current vertex. 
Note that a memoryless strategy can be described as a function a : V — > V. 

We can make the synchronized product explicit: an arena A and a memory 
structure M for A induce the expanded arena A x M. = (V X M, (V X M, Vu X 
M), E x fj,) where E x \i is defined by: ((v, m), (v' , m')) G E' if (v, v') <E E 
and ju(m, (v, v')) = m'. There is a natural one-to-one mapping between plays 
in A and in A x M., and also from memoryless strategies in A x A4 to strategies 
in A using A4 as memory structure. It follows that if a player has a memoryless 
winning strategy for the arena A x M., then he has a winning strategy using M 
as memory structure for the arena A. This key property will be used later on. 

A reachability objective requires that a vertex from a given subset of vertices 
F is reached: Reach(F) = {vo, i>i, i>2 ■ ■ ■ I 3p € N, v p € F} C II. Games 
in the form Q = (A, Reach(F)) are called reachability games. To determine 
whether Eve wins a reachability game, we compute the reachability set attractor. 
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We define the sequence (Attr;(F))i>o: 



Attro(-F) = F 

Attr i+1 (F) = Attri(F) U {u G V \ 3(u,v) G E, v G Attr^F)} 

U {u G Vb | V(u, w) 6 E, v G Attri(F)} 

Then Attr(F) is the limit of the non-decreasing sequence (Attrj(F))j>o- We 
can prove that We (A, Reach (F)) is exactly Attr(F). 

Generalized reachability objectives. A generalized reachability objective re- 
quires that each of the given k subsets of vertices F\ , . . . , F^ is reached: 

GenReach(Fi, . . . , F k ) = {ir \ Vi, 3pj G N, v Pi G FJ. 

Associating to each reachability objective a color, we can reformulate the gener- 
alized reachability objective: it requires to see each of the k colors at least once, 
in any order. Games in the form Q = (A, GenReach(Fi, . . . , Fk)) are called 
generalized reachability games. The special cases where in A, Va (respectively 
Vo) is empty are called one-player (respectively opponent-player) generalized 
reachability games. 

Example 1. We consider the arena drawn in FigureQ] A generalized reachability 
game is defined by the objective GenReach({l, 2}, {3}). The central vertex is 
the initial one. Eve tries to visit one of the two thick vertices and the dashed 
vertex. 




Fig. 1. An example of a generalized reachability game 



Contributions. Our contributions are as follows: 
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- We first prove that deciding the winner in generalized reachability games 
is PSPACE-complete. Using the same ideas, we also show that the one- 
player restriction, where all vertices belong to Eve, is NP-complete, and 
that the opponent-player restriction, where all vertices belong to Adam, can 
be solved in polynomial time. On the positive side, it is fixed-parameter 
tractable (FPT) with the number k of colors as parameter. 

- We study the size of the winning strategies for both players: we prove match- 
ing upper and lower bounds, i.e in any arena, if Eve has a winning strategy, 
then she has a winning strategy that uses 2 k — 1 memory states, and there is 
an arena where Eve wins but there are no winning strategies with less than 
2 k — 1 memory states, and similarly for Adam with the bound (i fe / 2 j) - 

- We then consider the subclasses where we restrict the number of vertices 
sharing the same color (in other words, the size of reachability sets). This re- 
veals a trichotomy: if three vertices are allowed to share the same color, then 
deciding the winner is, as in the general case, PSPACE-complete. However, 
if each color appears only once, then the problem is polynomial. If each 
color appears only twice, then the problem is polynomial for one-player 
games, where Eve controls all vertices. 

Outline. In section 3, we first study the complexity of solving generalized reach- 
ability games, for two-player and one-player games, and then give matching up- 
per and lower bounds for the memory required. In section 4, we consider the 
subclasses of games where the size of reachability set is restricted, in order to 
find tractable subclasses. 

3 The complexity of generalized reachability games 

In this section we prove that the winner problem in generalized reachability 
games is PSPACE-complete. Our PSPACE-hardness result follows from a re- 
duction from QBF (evaluation of a quantified boolean formula in conjunctive 
normal form). However, we show that solving generalized reachability games 
with few colors is easy, as it is fixed-parameter tractable using the number of 
colors as parameter. 

We then study one-player restrictions. We prove that the one-player gen- 
eralized reachability games are NP-complete. The other one-player restriction, 
opponent-player generalized reachability games, can be solved in polynomial 
time. 

The last subsection investigates memory requirements for both players. We 
present matching upper and lower bounds: Eve needs 2 k — 1 memory states and 
Adam ( , . /9 ,) , where k is the number of colors. 
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3.1 PSPACE-completeness of solving generalized reachability games 

As a first step we define a reduction from QBF to the winner problem of gener- 
alized reachability games. Consider a quantified boolean formula 

QlXi Q2X2 ■ ■ ■ Q n X n 4> , 

where is a prepositional formula in conjunctive normal form, i.e 

4> = A *M V V ... V iij. 
i<k 

and lij is either X{ or ->Xi for some i < n. We construct a generalized reach- 
ability game where Eve wins if and only if the formula is true. Intuitively, the 
two players will sequentially choose to assign values to variables, following the 
quantification order and starting from the outermost variable. Eve chooses ex- 
istential variables and Adam chooses universal variables. Formally, the game is 
as follows: 

- for each variable Xi, there are two vertices, Xi and xj; 

- for each variable xi, there is a choice vertex which leads to Xj and x[. The 
choice vertex belongs to Eve if Xi is existentially quantified, and to Adam if 
X{ is universally quantified; 

- for each variable x% with i < n, there are two edges from x% and ~xi to the 
next choice vertex Uj+i; 

- there is a sink s, and two edges from x n and x^ to s; 

- for each clause {^,1, . . . ,^ij t }, there is a reachability objective Fi which 
contains the corresponding vertices; 

- the generalized reachability objective is given by GenReach(Fi, . . . , FjS). 

The initial vertex is v±. There is a natural bijection between assignments of 
the variables and plays in this game; and an assignment satisfies the formula 
4> if and only if the play satisfies the generalized reachability objective. The 
evaluation order of the variables being the same in the formula and in the game, 
we conclude that Eve has a winning strategy if and only if the formula is true. 

Example 2. We consider the following quantified boolean formula 

Vx By Vz (x V ->y) A (->y V z) . 

Figure [2] shows the game built by the reduction. The generalized reachability 
objective is Reach({x, y}) A Reach({y, z}). Thick vertices represent the first 
reachability objective and dashed vertices the second one. 
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Fig. 2. An example of the reduction from QBF to generalized reachability games. 



Theorem 1 (Complexity of generalized reachability games). Solving gener- 
alized reachability games is PSPACFi-complete. 

Proof. The previous reduction implies the PSPACE-hardness. 

Let us first make a simple observation: if Eve has a winning strategy, then 
she has a winning strategy that visits each reachability set within n • k steps. 
Indeed, if she can enforce to visit a subset of vertices, then she can enforce it 
within n steps. 

Relying on this remark, we can simulate the game for up to n ■ k steps 
using an alternating Turing machine: whenever a vertex belongs to Eve, the 
corresponding state is disjunctive, and it is conjunctive if the vertex belongs to 
Adam. A path of length n ■ k is accepted if it is winning, i.e if it contains one 
vertex from each reachability set Fj. This machine accepts if and only if Eve 
wins, and works in polynomial time. Since APTIME = PSPACE, the result 
follows. I 

3.2 Parameterized complexity 

Solving generalized reachability games with few colors is easy: 

Theorem 2 (Generalized reachability games with k colors). Solving gener- 
alized reachability games is fixed-parameter tractable (FPT) with the number 
of colors as parameter. 

Roughly speaking, the only information needed during a play is the subset 
of reachability sets already visited. We build a memory structure that keeps track 
of this information. By constructing the product with this memory structure, we 
turn a generalized reachability game into a (classical) reachability game. 

Proof. We consider Q = (G, GenReach(Fi, . . . , Fj.)) a generalized reacha- 
bility game, and vq a starting vertex. The memory structure A4 is defined by 
(2^ 1 >-' k ^, mo, p), where tuq is {i \ vq G Fj}, and /j,(S, (v, v')) = S U {i \ v' € 
Fi}. Let F = {(_, S) | S = {1, . . . , A;}}: a play for the generalized reachability 
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game Q from vq is winning if and only if it is winning for the reachability game 
Q x M. = (G x M, Reach (F)) from (vo,mo). Since deciding the winner in 
a reachability game can be done in linear time using an attractor computation, 
solving a generalized reachability game can be done in time 2 k x 0(n + m). I 

3.3 Solving one-player restrictions 

Theorem 3 (One-player restrictions). Solving one-player generalized reacha- 
bility games is NP -complete. Solving opponent-player generalized reachability 
games is polynomial. 

Proof. We first deal with one-player generalized reachability games, where Eve 
controls all vertices. In our previous reduction, consider the case where all vari- 
ables in the original formula are quantified existentially. Then the problem corre- 
sponds to SAT (satisfiability of a boolean formula in conjunctive normal form), 
which is NP-complete. Resulting games are one-player games, i.e all vertices 
belong to Eve, hence solving one-player generalized reachability games is NP- 
hard. 

We describe a non-deterministic algorithm to solve these games in polyno- 
mial time. As noted before, if Eve wins, then she has a winning strategy that 
wins within n ■ k steps. The algorithm guesses a path of length n ■ k and checks 
whether it is winning. It follows that solving one-player generalized reachability 
games is NP-complete. 

We now consider opponent-player generalized reachability games, given 
by the objective GenReach(Fi, . . . The winning set for Adam is V \ 
Hi Attr(Fj), which can be computed in quadratic time. I 

3.4 Memory requirements 

We first present upper bounds: 

Lemma 1 (Memory upper bounds). For all generalized reachability games 
G = (G, GenReach(Fi, . . . , F k )), 

- if Eve wins, then she wins using a strategy with memory 2 k — 1; 

— if Adam wins, then he wins using a strategy with memory ( \j^/2\ ) • 

As in the proof for FPT membership, we make use of the memory struc- 
ture M = (2^ 1 '---' k \mo, fi), where tuq is {i \ vq G Fi}, and p,(S, (v,v')) = 
S U {i | v 1 G Fi}. Setting F as {(_, S) \ S = {1, . . . , A;}}, a play for the gen- 
eralized reachability game Q from vo is winning if and only if it is winning for 
the reachability game Q x M. = (G x A4, Reach(F)) from (vq, mo). 
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Proof. We consider Q = (G, GenReach(Fi, . . . , F^)) a generalized reachabil- 
ity game, and vq a starting vertex. Since in the reachability game Q x M., each 
player has memoryless winning strategies, each player has in Q a winning strat- 
egy using M. as memory structure. 

The memory set of M. has size 2 k . In order to get the correct bounds for 
each player, we rely on two observations. 

- Eve does not need a specific memory state to remember that all colors have 
been reached, as in this case, she has already won. Thus, she can always win 
with 2 k — 1 memory states. 

- If Adam wins in Q x A4 from (v, S) and S' C S, then he wins from (v , S') 
using the same strategy. Consider v € V a vertex in Q, and the set of sub- 
sets S such that (v, S) belongs to Adam's winning set. Its maximal (with re- 
spect to inclusion) elements are incomparable, so there are at most ( ) , 
we denote them by Si(v), . . . , S p (v). The idea is that from v, there are 
only p different options Adam has to consider, namely Si(v), ... , S p (v). 
Indeed, for any S such that (v, S) is winning for Adam, there exists an i 
such that S C Si(v), so Adam can forget S and assume the current position 
is (v,Si(v)). 

We define a memory structure on the memory set {1, ... , (i fe / 2 j) }• ^ e aun 
at constructing a strategy that will ensure that after a finite play it ■ v, the 
memory state is an i such that Si(v) contains the set of visited colors. If 
the initial vertex is vq, the initial memory state is an zq such that Si (vo) 
contains ttiq. We define the update function: fj,(i, (v,v')) is a j such that 
Sj(v') contains fi(Si(v),v') = Si(v) U {i \ v' £ Fj}. 
Let us turn to the next-move function. Consider (v, S) in Adam's winning 
set, then there exists a transition to some (v',S') also in Adam's winning 
set. Applying this to (v, Si(v)) such that S C Si(v), we get a vertex v', and 
define v(v, i) to v' . Playing this strategy, the above invariant is satisfied, and 
thus ensures to stay forever in Adam's winning set, so it is winning. The 
memory set contains (i fc / 2 |) memor y states. 



Lemma 2 (Memory lower bounds for both players). For all k, 

- there exists Q = (G, GenReach(Fi, . . . ,Fk)) a generalized reachability 
game, where Eve needs 2 k — 1 memory states to win; 

— there exists Q = (G, GenReach(i*i, . . . , i*fc)) a generalized reachability 
game, where Adam needs ( , , ,„ , ) memory states to win. 
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Fig. 3. A generalized reachability game where Eve needs 2 k — 1 memory states to win 



Proof. We first describe a generalized reachability game where Eve needs 2 k — 1 
memory states to win. This example was proposed in BCHH11I in a similar 
framework. The arena is shown in Figure |3j for k = 5. A vertex labelled by i 
belongs to Fj, and a vertex labelled by i has all colors but i. A play starts from 
the heart h; first Adam chooses a petal i, then Eve chooses either to reach color 
i before going back to the heart (the play goes on), or to reach every colors but 
% and to stop the play. Eve wins with the following strategy: the first time Adam 
chooses the petal i, she goes back to the heart; the second time, she stops the 
play. This strategy uses 2 k memory states. She can save one memory state by 
dropping the memory state corresponding to the case where she saw each petal, 
as it is winning for her. However, we show that there is no winning strategy for 
Eve with less than 2 k — 1 memory states. Let a a strategy using the memory 
structure A4 with less than 2 k — 1 memory states, and v its next-move function. 
For each memory state m, we consider S m = {i \ u(vi,fi(m, (h,Vi))) = i}, 
the set of petals where Eve would stop the play if Adam chose them. As there 
are less than 2 k — 1 memory states, there is a strict subset X of {1, . . . , k} 
which is not the stopping set of any memory state. Adam can win against a by 
choosing, at each step, a petal in the symmetric difference of X and S m , where 
m is Eve's current memory under a. (Indeed, if Adam plays forever in X, then 
Eve will never stop the play and only colors from X will be reached, otherwise, 
whenever Eve stops the play, the last memory state from the heart was an m 
such that X C S m , and the petal chosen is an i that belongs to S m \ X, hence 
that has never been reached.) 

We now describe a generalized reachability game won by Adam, where he 
needs (^ 2 j) memor y states to win. Let k = 2p + 1. A play consists in three 
steps: first Eve chooses p colors, then Adam chooses p colors, and third Eve 
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chooses p colors. In order to win, Adam must visit exactly the same colors Eve 
visited (which requires (^,/ 2 j) memory states), otherwise at least p + 1 colors 
have been visited when Eve plays for the second time, and she can choose and 
visit the remaining colors that have not yet been visited. I 

4 Restrictions on the size of reachability sets 

The above section shows two different directions which make generalized reach- 
ability games hard: the first is the complexity of solving generalized reachability 
games (PSPACE-complete), and the second is the memory required to con- 
struct winning strategies for both players (exponential in the number of colors). 

In this section, we restrict the size of the reachability sets in order to find 
tractable subclasses of generalized reachability games. 

Notice that our reduction from QBF only implies PSPACE-hardness when 
reachability sets have size at least three. Indeed, note that in the reduction, the 
size of a reachability set in the generalized reachability game corresponds to the 
size of the corresponding clause of the formula. Since the problem of evaluating 
a quantified boolean formula is polynomial if the formula has two variables per 
clause, our reduction does not imply the PSPACE-hardness of solving gener- 
alized reachability games with reachability sets of size one or two. This remark 
motivates our study of the subclasses of generalized reachability games where 
each color appears once, and then where each color appears twice. 

4.1 Reachability sets of size one 

The case where reachability sets are singletons is polynomial: 

Theorem 4 (Generalized reachability games where reachability sets have 
size 1). Solving generalized reachability games where reachability sets are sin- 
gletons is in PTIME. 

Proof. We denote by Vi the only vertex in Fi, for all i. In this case, the gener- 
alized reachability objective can be expressed by f\ i<k Reach(-Uj). We will see 
that Eve wins if and only if the preorder defined by v < v ' if v € Attr(u') is 
total. Intuitively, it means that a winning strategy prescribes: "reach i>/m, then 
Vf(2), and so on", where / is a permutation over {1, . . . , k}. 
We consider two cases: 

- If the preorder < is total over {vi | 1 < i < k}, then we show that We, 
set of winning positions for Eve, is njAttr(u j). Let v £ rijAttr(f j) and / 
a permutation over {!,...,&} such that for all 1 < i < k — 1, we have 
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v fu\ € Attr(-yj(j +1 )), we construct a winning strategy from v that reaches 
Vffu, then u/(2)> and so on. Note that this strategy only needs k memory 
states. Conversely, if v ^ njAttr(uj), then Eve cannot win, as Adam can 
prevent her from reaching some reachability set. 
- If the preorder H is not total, then there exist V{ and vj such that v i ^ 
Attr(fj) and Vj ^ Attr(uj). In this case Adam wins from everywhere, fol- 
lowing the strategy "if Vi or Vj has been reached, then avoid the other". Note 
that this strategy only needs 2 memory states. 

Checking that the preorder ^ is total can be done in polynomial time. I 

Note that as a corollary, we get memory upper bounds in this case: Eve 
needs at most k memory states and Adam at most 2. It is not difficult to see that 
these bounds are tight. 

4.2 Reachability sets of size two 

Let us now turn to the case where reachability sets have size two. We first ex- 
tend the technique used for the previous case: it was stated that "Eve wins if and 
only if there is a total order on colored vertices". A similar approach works for 
one-player arenas, through a reduction to the satisfiability problem of boolean 
formulas where clauses have size two. (This latter problem is known to be de- 
cidable in polynomial time.) 

Theorem 5 (Generalized reachability one-player games where color ap- 
pears twice). Solving generalized reachability one-player games where reach- 
ability sets have size two is in PTIME. 

Proof. As in the previous subsection, we consider the preorder defined by v < 
v ' if v € Attr(V). Note that in the case of one-player arenas, v € Attr(f') 
reduces to "there is a path from v to v'". 

Let Fi = {xi,yi} be the reachability sets, and vq be a starting vertex. We 
assume without loss of generality that there is a path from vq to every Fi (that is, 
either to Xi or yi), otherwise Eve cannot win. (This property is easily checked in 
deterministic polynomial time.) A first statement is as follows: Eve wins from 
vq if and only if there exist v±, . . . ,Vk colored vertices such that 

1. for all < i < k — 1, Vi -< V{ + ± and 

2. each color appears in {v\, . . . , v^}. 

We turn this condition into a boolean formula where clauses have size 2. We 
consider the 2 • k variables Xi and Yi, that correspond to vertices xi and yi. We 
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define the formula 0: 



(a) (b) 

where x, y ranges over colored vertices (that is, vertices from Fi for some i). 

We argue that Eve wins from vq if and only if <fi is satisfiable. Assume Eve 
wins from vo'. let v±,. . . ,Vk as in the previous statement, and set the corre- 
sponding variables to true and the others to false, we claim that the formula <fi 
is satisfied. Indeed, condition 2. ensures that the clauses under-braced (6) are 
satisfied, and for the clauses under-braced (a), let x,y such that x ^ y and 
y -f± x, if x is one of the Uj's, then y cannot be, so ->X V -Y holds. Conversely, 
assume that <p is satisfiable. The clauses under-braced (a) ensures that the order 
^ is total over vertices set to true. The clauses under-braced (6) ensures that at 
least one vertex from each reachability set is set to true. Combining those two 
statements, we reach the condition stated above. 

The latter allows to decide in polynomial time whether Eve wins from vq by 
checking the formula <fi for satisfiability. I 

We do not know the exact complexity of generalized reachability games 
where reachability sets have size 2. In the remaining of this subsection, we dis- 
cuss this question, focusing on memory requirements for both players. 

The memory required for Eve is still exponential, as shown in Figure |4] for 
k = 4. Specifically, it shows a generalized reachability game where reachability 
sets have size 2 won by Eve, where she needs 2L fc / 2 J +1 — 1 bits of memory to 
win. The arena is divided into two parts: the left hand side is a flower with [k/2\ 
petals, and the right hand side a one-player arena. The game starts at the heart 
of the flower. First, Eve asks for each petal a color. Once this task is completed, 
she can move to the right hand side to reach the remaining colors. Eve needs 
to remember the [k/2\ choices made by Adam (one for each petal), in order to 
reverse them: if Adam chose the color 1, then the color 2 has not been reached, 
so Eve has to choose color 2. Remembering those choices and asking for each 
petal requires 2L fc / 2 J +1 — 1 memory states. (This is the size of the complete 
binary tree of depth [k/2\.) 

On the other hand, the exact memory required for Adam remains open. The 
figure [51 following an idea of Christof Loeding, shows a generalized reacha- 
bility game where reachability sets have size 2 won by Adam, where he needs 
4 memory states to win. The game starts from the left hand side vertex. First 
Eve chooses and visits three of the four colors (two colors in the first column, 
1 and 2 or 3 and 4, and then one in the second column), and sends the pebble 
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Fig. 4. A generalized reachability game where Eve needs 2^ fc ' /2 J +1 — 1 memory states to win 

to the right hand side vertex, controlled by Adam. There, he has four options, 
each allowing all colors but one. Remembering the four possibilities requires 
four memory states, and leads to a win. However, with less memory states, one 
of the four option will never be played, and Eve wins. 

Quite surprisingly, we could not generalize this example to obtain a better 
lower bound than 4. We do not know whether this bound is tight (in any arena, 
if Adam wins, then he has a winning strategy with 4 memory states), which 
is plausible. Note that this would imply a coNP NP algorithm: guess a winning 
strategy for Adam with 4 memory states, and compose this strategy with the 
game, then solve the resulting one-player game. 




Fig. 5. A generalized reachability game where Adam needs 4 memory states to win 



Open problems. We were not able to give the exact complexity of generalized 
reachability games where reachability sets have size 2. The memory approach, 
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showing that Adam has winning strategy of constant size, seems promising to- 
wards this question. 
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